Pitch-synchronous time-scaling for high-frequency excitation regeneration
نویسندگان
چکیده
The goal of bandwidth extension of speech (BWE) is to extrapolate the missing low or high frequency components of the wide-band speech (50-8000 Hz) based entirely on information contained in a narrow-band signal (300-3400 Hz). In this paper we propose a new method for high-frequency regeneration of the excitation signal, using the correlation between the shape of the glottal flow waveform and the spectrum of the voice source. The high-band excitation is generated by performing a pitch-synchronous time-scale (PSTS) transformation on the linear prediction narrow-band residual to generate an high-pass signal that retains the periodic characteristics of the original signal but with a larger open quotient. This method is easy to implement and does not introduce discontinuities in the spectrum of the regenerated excitation. It can be used in applications for BWE where no side information is transmitted or for low bit coding of wide-band speech.
منابع مشابه
Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals
Timeand pitch-scale modifications of speech signals find important applications in speech synthesis, playback systems, voice conversion, learning/hearing aids, etc.. There is a requirement for computationally efficient and real-time implementable algorithms. In this paper, we propose a high quality and computationally efficient timeand pitch-scaling methodology based on the glottal closure inst...
متن کاملHigh-Quality Speech Modification Based on Pitch- Synchronous Harmonic and Non-harmonic Modeling of Speech
In this paper, we propose a high-quality speech modification method based on pitch-synchronous harmonic and non-harmonic modeling of speech. In the proposed method, the harmonic and non-harmonic parts of speech are modeled by the sum of sinusoids with frequencies corresponding to pitch multiples and with randomized frequencies, respectively. Then, harmonic and nonharmonic parts are synthesized ...
متن کاملIssues in high quality LPC analysis and synthesis
This paper deals with careful non-real-time LPC analysis. A baseline system is first described. lt uses a pitch-synchronous covariancemethod analysis with a laryngograph signal providing the pitch synchrony. Work to improve the voicing decision and F0 determination and to find a better voiced excitation waveform is described. Setting a lower Iimit on the value of B 1 is found to be useful. Buzz...
متن کاملTime -frequency analysis of vocal source signal for speaker recognition
This paper investigates the importance of spectrotemporal characteristics of the source excitation signal for speaker recognition. We propose an effective feature extraction technique for obtaining essential timefrequency information from the linear prediction (LP) residual signal, which are closely related to the glottal excitation of individual speaker. With pitch synchronous analysis, wavele...
متن کاملPitch-synchronous time-scaling for prosodic and voice quality transformations
Current time-domain pitch modification techniques have well known limitations for large variations of the original fundamental frequency. This paper proposes a technique for changing the pitch and duration of a speech signal based on time-scaling the linear prediction (LP) residual. The resulting speech signal achieves better quality than the traditional LP-PSOLA method for large fundamental fr...
متن کامل